Layered Interpretation of Street View Images

نویسندگان

  • Ming-Yu Liu
  • Shuoxin Lin
  • Srikumar Ramalingam
  • Oncel Tuzel
چکیده

We propose a layered street view model to encode both depth and semantic information on street view images for autonomous driving. Recently, stixels, stix-mantics, and tiered scene labeling methods have been proposed to model street view images. We propose a 4-layer street view model, a compact representation over the recently proposed stix-mantics model. Our layers encode semantic classes like ground, pedestrians, vehicles, buildings, and sky in addition to the depths. The only input to our algorithm is a pair of stereo images. We use a deep neural network to extract the appearance features for semantic classes. We use a simple and an efficient inference algorithm to jointly estimate both semantic classes and layered depth values. Our method outperforms other competing approaches in Daimler urban scene segmentation dataset. Our algorithm is massively parallelizable, allowing a GPU implementation with a processing speed about 9 fps.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

End-to-End Interpretation of the French Street Name Signs Dataset

We introduce the French Street Name Signs (FSNS) Dataset consisting of more than a million images of street name signs cropped from Google Street View images of France. Each image contains several views of the same street name sign. Every image has normalized, title case folded ground-truth text as it would appear on a map. We believe that the FSNS dataset is large and complex enough to train a...

متن کامل

DIFFUSE CONTRAST ENHANCEMENT ON MR IMAGES IN BRAIN INFARCTION: \"PSEUDOTUMOR SIGN\"

The purpose of this study was to describe the pattern of diffuse enhancement seen on contrast-enhanced MR images in patients with subacute infarction. A retrospective study of 104 patients with the diagnosis of stroke who had undergone contrast-enhanced MR scanning within2 weeks of the inciting neurological event revealed 66 patients who demonstrated different patterns of contrast-enhanceme...

متن کامل

LOX Framework: Designing Human Computation Games to Update Street Views

Although the Web has abundant information, it does not necessarily contain the latest, most recently updated information. In particular, interactive map websites and the accompanying street view applications often have outdated information because street views change constantly and are very costly to update. In this work, we propose the LOX (Labeling and O/X) framework – a scalable human comput...

متن کامل

Cataloging Public Objects Using Aerial and Street-Level Images - Urban Trees

In this section we provide the form of the projection functions Pv(`, c) that convert from geographic locations to pixel locations in aerial view and street view images. We give the form of the inverse function P−1 v (`′, c) that converts from pixel locations to geographic coordinates. Aerial images: Aerial view imagery in Google maps is represented using a Web Mercator projection, a type of cy...

متن کامل

I-45: Important Points in Interpretation of Sonographic Images of Female Pelvis (Imaging Case Review)

Ultrasonography represents the method of choice in the investigation of the female pelvis. An accurate interpretation of the images must take into consideration the specific features of the uterus, ovaries and fallopian tubes. The present case review aims to demonstrate important points in interpretation and management of the female pelvis images.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1506.04723  شماره 

صفحات  -

تاریخ انتشار 2015